p = 2 or 1 ----> 2 is better

scalling for the soft values -----> scalling is better

update rate (0.99 or 0.98 or 0.95) ------> 0.99 is the best

Number of taps (relative to delay spread) ------> 14 and 4 is the best for one symbol delay spread
but for 5 symbols (may be 20,6)

take the symbol from the last tap is more correct from the middle (output of the filter will be taken when the desired symbol at which tap).
initialization for the filter taps (one at the desired tap and zeros for others) will be important some how when p=1 (symbol space), but for fractional space it has a small effect.
